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The coronavirus COVID-19 is affecting 196 countries and territories around 
the world. The number of deaths keep on increasing each day because of 
COVID-19. According to World Health Organization (WHO), infected 
COVID-19 is slightly increasing day by day and now reach to 570,000. 
WHO is prefer to conduct a screening COVID-19 test via online system. A 
suitable approach especially in string matching based on symptoms is 
required to produce fast and accurate result during retrieving process. 


Keywords: Currently, four latest approaches in string matching have been implemented 
AJAX in string matching; characters-based algorithm, hashing algorithm, suffix 
a automation algorithm and hybrid algorithm. Meanwhile, extensible markup 
Coronavirus language (XML), JavaScript object notation (JSON), asynchronous 
COVID-19 symptoms JavaScript XML (AJAX) and JQuery tehnology has been used widelfy for 
Hybrid algorithm data transmission, data storage and data retrieval. This paper proposes a 
JQuery combination of algorithm among hybrid, JSON and JQuery in order to 
JSON produce a fast and accurate results during COVID-19 screening process. A 
few experiments have been by comparison performance in term of execution 
time and memory usage using five different collections of datasets. Based on 
the experiments, the results show hybrid produce better performance 
compared to JSON and JQuery. Online screening COVID-19 is hopefully 
can reduce the number of effected and deaths because of COVID. 
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1. INTRODUCTION 

COVID-19 is a virus called SARS-CoV-2. This virus is appeared to have first emerged in Wuhan, 
China [1]. SARS-CoV-2 is considered a novel human-infecting Beta coronavirus [2]. SARS-CoV-2 is a 
member of the family Coranaviridae and order Nidovirales [3]. This family consist of two subfamilies: 
Coronavirinae and Torvirinae. Subfamily of Coronavirinae is divided into four genera: (i) Alphacoronavirus 
which is contains the human coronavirus 229E (HCoV-229E) and human coronavirus NL63 (HCoV-NL63); 
(ii) Betacoronavirus includes human coronavirus OC43 (HCoV-OC43), severa acute respiratory syndrome 
human coronavirus (SARS-HCoV), human coronavirus HKU1 (HCoV-HKU1), and middle eastern 
respiratory syndrome coronavirus (MERS-CoV); (iii) Gammacoronavirus includes viruses of whales and 
birds; and (iv) Deltacoronavirus includes isolated from pigs and birds [4]. SARS-CoV-2 is an enveloped and 
positive-strand RNA (+ssRNA) virus. 

COVID-19 has become a worldwide threat because of death of 26,495 individuals and infected 
more than 570,000 [5]. A few symptoms such as a fever, dry cough, and tiredness, and in mild cases people 
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have a potential to get this virus [1]. According to several cases, people with the virus can develop difficulty 
of breathing, and may ultimately experience organ failure. According to World Health Organization (WHO), 
most people infected with the COVID-19 is recover without requires a special treatment [6]. However, older 
people and those with underlying medical problems such as cardiovascular disease, diabetes, chronic 
respiratory disease, and cancer are more likely to develop serious illness. 

The COVID-19 virus is primarily spreading through droplets of saliva or discharge from the nose 
when an infected person coughs or sneezes. The best way to prevent or slow down transmission of infection 
COVID-19 is protecting yourself and others by washing your hands or using alcohol-based run frequently 
and not touching your face [1]. The number of infected COVID-19 is slightly increasing day by day and now 
reach to 570,000. In order to manage this amount of data, a suitable online system needs to avoid any delay 
during the execution process. This study was conducted to propose an efficient storing and retrieving data 
approach to execute a large amount of data at concurrent time access. 


2. RESEARCH METHOD 

Symptoms of COVID-19 are different among individuals because of ranging from asymptomatic 
infection to severe respiratory failure [7]. According to Italian population study which is conducted by the 
town of Vo Euganeo by Dr. Lavezzo and colleagues, around 50-70% of individuals with positive polymerase 
chain reaction (RT-PCR) throat swab results remain symptoms and a further small percentage (about 10% of 
all symptomatic patients) present dyspnoea, severe interstitial pneumonia, acute respiratory distress 
syndrome (ARDS) and multi-organ dysfunction. Meanwhile, the vast majority of individuals with symptoms 
had one or more coexisting medical conditions such as hypertension, diabetes, and cardiovascular disorders 
with elevated case facilities amongst elderly and frail patients [8], [9]. Others common symptoms of the 
disease such as fever, cough, fatigue, slight dyspnoea, sore throat, headache, and conjunctivitis. That way, it 
is difficult to differentiate COVID-19 from other respiratory disease [10]-[12]. Meanwhile, involvement of 
gastrointestinal such as diarrhoea, nausea and vomiting are lower percentage of cases. Table 1 shows summary 
of symptoms observed in various cohorts of patient. Based on Table 1, fever and cough contributed high 
percentages between 44-98%. Meanwhile, the second high percentages between 19-83% are dyspnoea and 
sputum. Other’s symptoms such as sore throat, headache, diarrhoea, nausea/vomiting and myalgia 
contributed less than 15%. 


Table 1. Symptoms observed in various cohorts of patient 
Guan etal. [13] Chen etal. [14] Shietal. [15] Huang etal. [16] Yang etal. [17] 


Patients 1081 99 21 41 52 
Fever 473 (44%) 82 (83%) 18 (86%) 40 (98%) 46 (89%) 
Dyspnoea 205 (19%) 31 (31%) 9 (43%) 22 (43%) 33 (64%) 
Cough 745 (69%) 81 (82%) 15 (71%) 31 (76%) 40 (77%) 
Sputum 370 (34%) - 3 (14%) 11 (27%) - 
Rhinorrhoea 53 (5%) 4 (4%) 5 (24%) - 3 (6%) 
Sore throat 153 (14%) 5 (5%) - - - 
Headache 150 (14%) 8 (8%) 2 (10%) 2 (5%) 3 (6%) 
Diarrhoea 42 (4%) 2 (2%) 1 (5%) 1 (2%) - 
Nausea/vomiting 55 (5%) 1 (1%) 2 (10%) - 2 (4%) 
Myalgia 164 (15%) 11 (11%) - - 6 (12%) 


3. MATCHING ALGORITHM, AJAX AND JQUERY TECHNOLOGY 

The motivation of this study is to develop an application for online screening process COVID-19 
based on a suitable architecture. The architecture must be able to manage multiple users at concurrent time 
access based on symptoms in Table 1. Increasing number of infected COVID-19 required fast data 
transmission between client and server. In this paper, four different approaches for string matching have been 
identified: i) character-based approach, ii) hashing based approach, and iii) suffix automata-based approach. 
Meanwhile, AJAX and jQuery are the most relevant technologies for fast data transmission between client 
and server side. 

One of the traditional approaches in string matching problems is character-based approach [18]. In 
this approach, each character is compared from left to righ individually at the cost of extra time requirements. 
Two stages involved in this approach: searching and shift phases. Based on the experiments, results in the 
corpus of Arabic, English, Chineses, Italian, and French texts are promising. However, the time complexity 
of the slgorithm may increase when searching for multiple patterns in given text. 

Another approach for string matching is hashing [19]. Hashing value has been used for matching 
process and conducts comparison from left to right. For instance, a given pattern p with characters m, the 
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hashing values are calculated first. Then, the hash value is used for matching between a given text t and a 
pattern p. After the hash function is used, the searching phase uses character comparison. Therefore, the 
algorithm involves two key steps: pre-processing and searching process. In pre-processing phase, a string of c 
characters is converted into a string of d decimal number. The pattern p is divided by pre-defined prime 
number q. Modulus operation is the used to calculate the remainder of pattern p with q. For each shift that 
ranges from shift (s = 0 to n - m), the remainders of pattern and are compared for matching. Once a match is 
found, brute-force approach is implemented to verify the result of matching. This approach saves large 
amount of computation as it compares integer values instead of characters. However, the limitation of this 
approach is hash collision. Hash collision is occurring when two different strings map to the same number. 

Suffix automation is an automation that comparises two related but distinct automata constructors; 
deterministic acyclic finite state automation and suffix automation for matching [20]. Based on this approach, 
it can be defined as D(p) = {Q, qo, F, £, ô}. Here, Q = {q1, q2, q3, qm} is set of states, F = {qm} is set to 
accept states, and 6 = Q x X —> Q is the transition function. This approach uses a directed acyclic graph in 
which nodes/vertices are called states, and edges between nodes are considered a transition between the 
states. One of the states (node) denoted by qo is called the initial state of the suffix automation from where 
we can reach to all other states in the automation. For instance, we have a pattern input “abbabb”, each state 
qo denoted by 0 in the example. The suffix automation while traversing from state 0 to terminal nodes must 
represent a suffix that is case, the possible suffixes of “abbabb” are “b”, “bb”, “abb”, “babb”, and “bbabb”. 
The basic idea in this approach Is the text ¢ is scanned from left to right, and the algorithm decides the 
number of pattern p to be shifted to avoid redundant comparisons during a mismatch. 

Hybrid algorithm is introduced based on combination the advntages of different algorithms and is 
better than individual algorithms [21]-[22]. In this approach, one or more characters are combined and placed 
under automation-based approach. This hybrid algorithm is a modified version of Horspool algorithm [23] 
with additional string-matching conditions for scanning and matching the text (from left to right) and string 
pattern (from right to left). This algorithm has two steps. In the first step, the pivoting character in the pattern 
is searched by computing the character distance and its maximum safe shift. In the second step, the pivot 
character will be compared to characters of the text. If the pivoting character matches with the characters in 
the text, then the algorithm starts matching the pattern with text from the right most character until the end of 
the text. If the matching fails, then hybrid Horspool shift is used for matching. This algorithm is proven 
achieves good performance for pattern matching on human proteins, text of natural language (e.g., Arabic, 
Italian, French and Chinese), and E. coli genome. 

Asynchronous JavaScript XML (AJAX) is a bundle technology used to simplify the implementation 
of rich and dynamic web application [24]. AJAX provides advanced and sophisticated user interactions 
between application/system and users. In fact, the synchronous request-response protocol used by traditional 
application/system introduces delay between one page to the next one. Features of document object model 
(DOM) increase the power of AJAX functionality. 

JQuery technology is lightweight JavaScript library that emphasises interaction between JavaScript 
and hypertext markup language (HTML) [25]. JQuery work is almost like AJAX, whose main characteristic 
is to load data on a web page without reloading the entire page [26]. The advantages of JQuery and AJAX 
include bandwidth usage and separation of data, format, style, and function [25]. 

Extensible markup language (XML) and JavaScript object notation (JSON) are the most common 
data model used by AJAX and JQuery. XML has widely been accepted as the relevant standardisation for 
representing and exchange the data on the web [27]-[29]. Meanwhile, JSON is alternative data model 
compared to XML due to its relative simplicity and compactness [30]. JSON provides significant 
performance over XML, which require extra libraries to retrieve data from document object model (DOM) 
objects. JSON is estimated to parse up to one hundred times faster than XML in modern browser [29]. This 
study uses matching algorithms approach in order to improve a performance of data transmission between 
client and server side. Main task of matching algorithm is to calculate possibility of infected COVID-19 
based on each symptom weightiest. 

Based on the explanation above, hybrid algorithm is choosen and will be integrated with AJAX and 
JQuery for searching and retrieving process. There are calculate weightiest for each symptom, choose a 
suitable technology for data transmission between client side and server side, generate a data model and 
analyse a performance of the model. Next section describes detail about our proposed method. 


4. METHODOLOGY 
In our methodology, the first step is to produce a weightiest for each symptom based on Table 1. 
Table 2 represent value of weightiest for each symptom. Each weightiest are calculated based on (1). 
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Total percentage 
Average = =~ 
Number of symptoms 


Table 2. Average for each symptom 
Symptoms Guan et al. [13] Chen etal. [14] Shier al. [15] Huang et al. [16] Yang etal. [17] _Average 


Fever 0.44 0.83 0.86 0.86 0.89 0.776 
Cough 0.69 0.82 0.71 0.71 0.77 0.740 
Dyspnoea 0.19 0.31 0.43 0.43 0.64 0.400 
Diarrhoea 0.04 0.02 0.05 0.50 0.00 0.305 
Rhinorrhoea 0.05 0.04 0.24 0.24 0.06 0.126 
Headache 0.14 0.08 0.10 0.10 0.06 0.096 
Myalgia 0.15 0.11 0.00 0.00 0.12 0.076 
Nausea/vomiting 0.05 0.01 0.10 0.10 0.04 0.060 
Sore throat 0.14 0.05 0.00 0.00 0.00 0.038 

Total 2.757 


The weightiest for each symptom will be used to calculate percentage possibility based on input 
from users. The second step is to design a suitable architecture for data transmission between client side and 
server side. Figure 1 shows two different architectures will be implemented in the data transmission between 
client side and server side. Three main elements involve in this architecture: client-side engine, data model 
and matching algorithm. 


a 
ej 


r 
A $ 
- S 


Tà y à 


“AJAX engine” ‘JQuery engine 
(javascript) (javascript) 


XMUXHTML 
or 
Javascript data 


matching algorithm, 
database, data handling, 
etc 


Figure 1. XML and JSON architecture 


4.1. Client-side engine 

Hybrid algorithm, AJAX and JQuery will be used for client-side engine. Figures 2, 3, and 4 
represent a syntax of hybrid algorithm, AJAX and JQuery. 
a) Hybrid algorithm 

This algorithm will be used to search, and match based on symptoms according to Table 1. 
b) AJAX 

AJAX is combining the techniques on cascading style sheets (CSS), JavaScript extensible hypertext 
markup language (XHTML), and document object model (DOM). There are three main elements in AJAX; 
initial declaration, target uniform resource locator (URL) and execute URL. 
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c) JQuery 

JQuery events are a flexible and simple way of using JavaScript events. JQuery’s application 
programming interfaces (APIs) provide both wrapper method for common JavaScript events, as well as more 
detailed application programming interfaces (APIs) in the on (), off () and trigger () methods. Three main 
components in JQuery; URL or target destination, method type both post/get, and data. 


function fx () { 


Assign pattem p ‘/ initial declaration variable 


Find length of pattem, p equal to 4, =new} : 
If mismatched (where 0 <i < kandb, + b,+D S ee meal 
shifted to exactly (f-i+ J) or (k —j + J) positions 


‘/ target URL 


set position (0 < $ < m or j) aaa ha 
Else 


Repeat the searching process } http send0); 


Figure 2. Hybrid algorithm Figure 3. AJAX syntax 


S(functionO { 
S("+form").submit(function(e) 
e.preventDefaultQ; 
S.ajax({ 
url: x, 


type: y, 
data: $(#form’).serializeQ, 
success: function(data) { 

/# message success/fail 
} 


Figure 4. JQuery syntax 


4.2. Data model 

This section explained two different data model: XML and JSON. Figures 5 and 6 shows a standard 
data model structure for XML and JSON. 
a) XML 

XML is designed as an all-purpose format for organising data. In many cases, we are decided to use 
XML to store data in a standardised way, rather than creating a new format convention. Figure 5 shows the 
standard format for XML document. Three main elements in Figure 1, there are root, child and subchild. 
Roots represent name of object. Child represent name of attribute and subchild represent name of the sub- 
attribute. 
Definition 1: Let, XML file, B = {R, C, H}, where S is represented as object, C is represented attribute name, 
and H is represented as sub-attributes name. 


</subchild> 


Figure 5. XML schema 


b) JSON 

JSON file has a standard format or schema. Figure 5 shows the standard of JSON file. Three main 
elements involved in the JSON file: title, type and properties. Properties is similar with attributes. The 
properties can be one or more than one. Based on Figure 5, a; until an is attribute for each object. 
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Definition 2: Let, JSON file, A = {S,O,X}, where S is represent a title, O is represent object name, and X is 
represent properties or attributes. 
Definition 3: Each O can have more than one object, which is a; until an is number of objects in JSON file. 


F (G) =O {aı, az, a3, an} 


Definition 4: Each X can have one or more properties, which is a; until ni is properties of X. 


F (H) = X {bi, bo, b3, bn} 


According to Figure 6, three elements involve in JSON: title, type and properties. Title is 


represented object name; type is represented object type; and properties is represented details about the 
object. 


5. 


Figure 6. JSON schema 


RESULTS 
In this section, the performance of AJAX and JQuery using matching algorithm will be analysed in 


term of time execution and memory usage. We evaluate the performance of the accessing system at the 
concurrent time. The systems are build using a personal computer equipped with 2.40 GHz Intel(R) Core 
(TM) i7-5500 CPU, 8.00 GB and a 250 GB solid-state drive. The operating system is Microsoft Windows 10. 
We are using existing number of patients (1294) according to Table 1 to test the performance of AJAX and 
JQuery based on matching algorithm in Figure 7. 


5.1. Modeling process 


Four components involve in modelling process. There is graphical user interface (GUD), 


AJAX/JQuery, hybrid algorithm, and data model. 


a) 


b) 


c) 


d) 


Graphical user interface (GUI). A simple GUI as shown in Figure 7 are designed to allow users choose 
any related symptoms before making a submission. First process is submission. Once submission is 
done, the next process AJAX/JQuery. 

AJAX/JQuery. Figures 8 and 9 shows a simple function using AJAX and JQuery. In AJAX, value of x 
is variable, value of y is representing a method; post and value of z is representing as a 
URL/destination; send_data.php. Meanwhile, value of x represents target URL; send_data.php and 
value of y represent method; post for JQuery. 

Hybrid algorithm. Figure 10 shows a program structure based on hybrid algorithm in Figure 2. Based on 
this program structure, first steps are initial declaration of symptoms (s1, S2, Sn) and list of data from 
users (data), data2, datan). Looping process using for will be used to calculate possible percentage based 
matching symptoms and weightiest. Finally, this program structure will display a result (in percentage) 
as a total. 

Data model. This section explained the implementation of XML and JSON as a data model. Figures 11 
and 12 shows a data model in XML and JSON. 
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COVID 19 
Check your COVID 19 symptoms by using online system. 
Symptoms of Covid 19 are taken from research by Department of Community Medicine, USM 


Gender/Jantina 


~- Choose Gender -- $ 


Enter the following information as below:- 


Do you have direct contact with Covid 19 patient? 
Anda mempunyai kaitan dengan pesakit covid 19? 


Do you have these symptoms ? 

Adakah anda mempunyai gejala berikut? 
Fever/Demam 
Dry Cough/Batuk 
Fatique/Letih Lesu 
Diarhea/Cirit-Birit 
Shortness Breath/Sesak Nafas 
Headache/Sakit Kepala 
Sore ThroavSakit Tekak 
Muscle or Join Pain/Sakit Sengal Badan 
Sneezing/Bersin 
Runny Nose/Hidung Berair@ Tersumbat 
Not Related/Tidak Berkaitan 


Figure 7. GUI form based on COVID-19 symptoms 


i { 
Fit 
Functio target URL 0 ( SC" orm').submit(function(e) 


target URL 
bp open rara == data php, true). 
onl; 


Figure 8. AJAX approach Figure 9. JQuery approach 


Snum = sn / ‘Tet say sn equal number of symptoms 


for($}=0; $j<Snum; $}++) { 


if (Sdata[Si]!=") { 
Stotal = Stotal + f saatsi * $s[Sj); 
} 


Figure 10. Program code based on MA 
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<List> 
<Age>...</Age> 
=Gender>._.<'Gender> 
=Contact>...<'Contact> 


~=Gender>._.<'Gander> 
<Fever>_..<Fever> 
=Cough>...<'C 
<Myalgia>...</Myalgia> 
List 


Figure 11. XML data model 


[{"Age":"","Gender":"","Contact":"","Gender":"L","Fever":"","Cough":""}.{"Myalgia":"","Age":"","Gender":"","C 


ontact":"" "Fever":"" "Headache":""}, __] 


Figure 12. JSON data model 


5.2. Comparative analysis 

Table 3 shows the comparative performance among different approaches based on 1294 users’ 
access at the concurrent time. Two types of measurement performance will be evaluated: time execution and 
memory usage. First, according to Table 3, time execution of approach V based on XML is reduce around 
10-15% compared to approach I and II. Meanwhile, time execution of approach VI based on JSON is reduce 
around 5-15% compared to approach II and IV. In this case, approach VI is better compared to other 
approaches in term of time execution. 

In term of memory usage, approach V and VI using less memory usage compared to other 
approaches. In this case, performance of approach V and VI is better compared to other approaches because 
of execution concurrent users (1294) at the same time using less memory but still can produce the output 
ether positive or negative COVID-19 based on hybrid algorithm. Figures 13 and 14 represent time execution 
and memory usage comparison among six (6) approaches. Based on Figures 13 and 14, approach VI (JQuery 
+ JSON + hybrid algorithm) is clearly between compared to other approaches which is fast time execution 
time and use less memory. 


Table 3. Time execution and memory usage 


Approach Parameters Time execution (ms.) _ Memory usage (%) 
I AJAX + XML 0.03335 0.00960 
II AJAX + JSON 0.01195 0.00925 
Il JQuery + XML 0.02635 0.00860 
IV JQuery + JSON 0.01105 0.00965 
Vv JQuery + XML + Hybrid Algorithm 0.02305 0.00785 
VI JQuery + JSON + Hybrid Algorithm 0.00900 0.00695 
0.012 
0.01 
oe 
E 0.008 
» 
ed == 
5 0.006 = 
4 ===> 
g = 
= 0.004 = 
= = 
0.002 = 
o _ 
vi 
= Percentages (%) 0.0096 0.00925 0.0086 0.00965 0.00785 | 0.00695 


Figure 13. Time execution performance 
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Percentages (%) 
sag 
© 
[æ] 
n 


@ Percentages (%) 0.0096 0.00925 0.0086 0.00965 0.00785 0.00695 


Figure 14. Memory usage performance 


6. CONCLUSION 

In conclusion, this study had successfully proposed and implemented an efficient storing and 
retrieving data approach to execute a large amount of data at concurrent time access. The approach using 
Jquery + JSON + hybrid algorithm had fast time execution time and uses less memory is significant to 
analyse the data of COVID-19 online screening based on its symptoms using a keyword search algorithm. 
The results proved that the Jquery + JSON + hybrid algorithm was the efficient storing and retrieving data 
approach to execute a large amount of data at concurrent time access compared to the AJAX + XML, AJAX 
+ JSON, JQuery + XML, JQuery + JSON, and JQuery + XML + hybrid algorithm. The findings of this study 
had supported those of the previous ones proving the Jquery + JSON + hybrid algorithm as the best approach 
in storing and retrieving data compared to the others method. The processes of initialising the user input and 
weightiest symptoms were several of the processes involved in the hybrid algorithm. A Hybrid algorithm 
process also explained the execution of mapping, value and total data model. The limitation of this study is a 
narrow focus. This study focused only on a few various cohorts of the patient. Further study in the future may 
involve many different cohorts and a large number of samples to provide more precise study findings. 
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